Towards a TEI-based encoding scheme for the annotation of parallel texts
Identifieur interne : 000076 ( Main/Exploration ); précédent : 000075; suivant : 000077Towards a TEI-based encoding scheme for the annotation of parallel texts
Auteurs : Peter Boot [Pays-Bas]Source :
- Literary and Linguistic Computing [ 0268-1145 ] ; 2009-09.
Descripteurs français
- Pascal (Inist)
- Wicri :
- topic : Traduction.
English descriptors
- KwdEn :
Abstract
Translation, adaptation, and other forms of appropriation of literary works can result in bodies of parallel texts. For the purpose of studying appropriation strategies, it is important to be able to annotate digital representations of these parallel text structures. This article uses early modern emblem culture (books of engravings or woodcuts, accompanied by mottos and explanatory texts) to investigate the forms this text parallelism may take. It defines requirements for annotation definition and proposes a TEI (Text Encoding Initiative) extension to implement these requirements. In the proposed encoding scheme, TEI feature structures will be used for storing annotation information. This scheme should be useful for annotating parallel text structures as well as for other annotation tasks. The annotation scheme assumes the annotated texts are available in XML. If this is not the case (there is no electronic version of the text at all or perhaps only a facsimile) the article suggests the definition of a TEI proxy document. A TEI proxy document contains enough of the structural aspects of the texts to serve as a basis for attaching annotations to the text. Outside of the annotation context, proxy documents may serve as a basis for adding functionality to image-based editions.
Url:
DOI: 10.1093/llc/fqp023
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 000320
- to stream Istex, to step Curation: 000320
- to stream Istex, to step Checkpoint: 000045
- to stream Main, to step Merge: 000076
- to stream PascalFrancis, to step Corpus: 000009
- to stream PascalFrancis, to step Curation: 000036
- to stream PascalFrancis, to step Checkpoint: 000005
- to stream Main, to step Merge: 000108
- to stream Main, to step Curation: 000076
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title>Towards a TEI-based encoding scheme for the annotation of parallel texts</title>
<author wicri:is="90%"><name sortKey="Boot, Peter" sort="Boot, Peter" uniqKey="Boot P" first="Peter" last="Boot">Peter Boot</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:C87E170B5B672608C2723A492673F975F0780E43</idno>
<date when="2009" year="2009">2009</date>
<idno type="doi">10.1093/llc/fqp023</idno>
<idno type="url">https://api.istex.fr/document/C87E170B5B672608C2723A492673F975F0780E43/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000320</idno>
<idno type="wicri:Area/Istex/Curation">000320</idno>
<idno type="wicri:Area/Istex/Checkpoint">000045</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000045</idno>
<idno type="wicri:doubleKey">0268-1145:2009:Boot P:towards:a:tei</idno>
<idno type="wicri:Area/Main/Merge">000076</idno>
<idno type="wicri:source">INIST</idno>
<idno type="RBID">Francis:11-0223733</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000009</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000036</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000005</idno>
<idno type="wicri:explorRef" wicri:stream="PascalFrancis" wicri:step="Checkpoint">000005</idno>
<idno type="wicri:doubleKey">0268-1145:2009:Boot P:towards:a:tei</idno>
<idno type="wicri:Area/Main/Merge">000108</idno>
<idno type="wicri:Area/Main/Curation">000076</idno>
<idno type="wicri:Area/Main/Exploration">000076</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a">Towards a TEI-based encoding scheme for the annotation of parallel texts</title>
<author wicri:is="90%"><name sortKey="Boot, Peter" sort="Boot, Peter" uniqKey="Boot P" first="Peter" last="Boot">Peter Boot</name>
<affiliation wicri:level="1"><country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Huygens Institute, Royal Netherlands Academy of Arts and Sciences, The Hague</wicri:regionArea>
<wicri:noRegion>The Hague</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Pays-Bas</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Literary and Linguistic Computing</title>
<idno type="ISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint><publisher>Oxford University Press</publisher>
<date type="published" when="2009-09">2009-09</date>
<biblScope unit="volume">24</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="347">347</biblScope>
<biblScope unit="page" to="361">361</biblScope>
</imprint>
<idno type="ISSN">0268-1145</idno>
</series>
<idno type="istex">C87E170B5B672608C2723A492673F975F0780E43</idno>
<idno type="DOI">10.1093/llc/fqp023</idno>
<idno type="ArticleID">fqp023</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Computational linguistics</term>
<term>Corpus annotation</term>
<term>Feature structure</term>
<term>Markup language</term>
<term>Parallel corpus</term>
<term>TEI</term>
<term>Text structure</term>
<term>Translation</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Annotation de corpus</term>
<term>Corpus parallèle</term>
<term>Langage de balisage</term>
<term>Linguistique informatique</term>
<term>Structure de traits</term>
<term>Structure textuelle</term>
<term>TEI</term>
<term>Traduction</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr"><term>Traduction</term>
</keywords>
</textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract">Translation, adaptation, and other forms of appropriation of literary works can result in bodies of parallel texts. For the purpose of studying appropriation strategies, it is important to be able to annotate digital representations of these parallel text structures. This article uses early modern emblem culture (books of engravings or woodcuts, accompanied by mottos and explanatory texts) to investigate the forms this text parallelism may take. It defines requirements for annotation definition and proposes a TEI (Text Encoding Initiative) extension to implement these requirements. In the proposed encoding scheme, TEI feature structures will be used for storing annotation information. This scheme should be useful for annotating parallel text structures as well as for other annotation tasks. The annotation scheme assumes the annotated texts are available in XML. If this is not the case (there is no electronic version of the text at all or perhaps only a facsimile) the article suggests the definition of a TEI proxy document. A TEI proxy document contains enough of the structural aspects of the texts to serve as a basis for attaching annotations to the text. Outside of the annotation context, proxy documents may serve as a basis for adding functionality to image-based editions.</div>
</front>
</TEI>
<affiliations><list><country><li>Pays-Bas</li>
</country>
</list>
<tree><country name="Pays-Bas"><noRegion><name sortKey="Boot, Peter" sort="Boot, Peter" uniqKey="Boot P" first="Peter" last="Boot">Peter Boot</name>
</noRegion>
<name sortKey="Boot, Peter" sort="Boot, Peter" uniqKey="Boot P" first="Peter" last="Boot">Peter Boot</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000076 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000076 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Ticri |area= TeiVM2 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:C87E170B5B672608C2723A492673F975F0780E43 |texte= Towards a TEI-based encoding scheme for the annotation of parallel texts }}
This area was generated with Dilib version V0.6.31. |